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(54) Data processing method and apparatus 

(57) Binary data to which meta-data is to be 
appended, and meta-data to be appended to the binary 
data are loaded. The meta-data has an XML format. 
The loaded meta-data is connected after the loaded 
binary data. The data synthesized in this manner is out- 
put as a single file as a whole. In this file data, since 
header information remains the same as that before 
connection of the meta-data, binary data can be repro- 
duced by an existing browser or the like. A binary file 
with meta-data can be discriminated by checking if data 
In a correct XML format is present at the end of data. 
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Description 

FIELD OF THE INVENTION 

[0001] The present invention relates to a data s 
processing method and apparatus, which process 
binary data and meta-data, and a storage medium. 
[0002] The present invention also relates to a data 
processing method and apparatus, which process 
image data and area information that pertains to the 10 
image data, and a storage medium. 

BACKGROUND OF THE INVENTION 

[0003] Meta-data is "data pertaining to data", and is is 
used as data that explains binary data such as image 
data, audio data, and the like. However, when binary 
data and corresponding meta-data are present as inde- 
pendent files, the user must manage the binary data 
and meta-data at the same time upon moving or copy- 20 
ing files, resulting in inconvenience. 
[0004] In general, in order to facilitate management 
of binary data and meta-data, various methods that 
describe binary data and meta-data have been pro- 
posed. Such conventional techniques can be catego- 25 
rized into methods of defining new binary formats, and 
methods that manage those data using databases. 
[0005] As examples of the method of defining a new 
binary format, Tiff, Exit, Flashpix, and the like are known 
as new image formats. Fig. 24 shows the concept of a 30 
format in which meta-data is embedded in binary data. 
As binary data, for example, image data is used. As 
shown in Fig. 24, a field for describing meta-data is nor- 
mally allocated in the header field of an image, and the 
user describes meta-data in that field. By describing 35 
meta-data in such way, data can be easily searched and 
classified. Since binary data includes meta-data, they 
can be managed by a single file, thus allowing relatively 
easy file management 

[0006] A method of managing binary data and 40 
meta-data using a database will be explained below. 
Fig. 25 shows the concept of the method of managing 
binary data and meta-data using a database. The 
method of managing binary data and meta-data, which 
are present as independent files, using a database or 45 
the like, as shown in Fig. 25, is also prevalent. In this 
case, existing binary data can be used by an existing 
application without any modifications. However, the 
method of defining a new format that describes meta- 
data, and the method of managing meta-data using a so 
database suffer their respective problems. 
[0007] When a new format that describes meta- 
data is defined, existing binary data must be converted 
into the new format, and meta-data must be described 
in that new format. Furthermore, in order to search 55 
binary data using the meta-data in the new format, an 
application compatible to the new format is required. 
That is, in order to describe and use meta-data, a large 
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number of steps and dedicated environment are 
required. Also, In order to process binary data In such 
new format (e.g., to output an Image if the binary data is 
Image data), an application compatible to that format is 
required, and an existing application cannot cope with 
this situation. 

[0008] In addition, since the description method of 
meta-data is uniquely determined in the new format, a 
new search routine must be created to create an appli- 
cation that uses meta-data in the new format. Further- 
more, in order to describe meta-data in a new field, the 
specifications of the format must be changed. 
[0009] On the other hand, when binary data and 
meta-data are simultaneously managed using a data- 
base, meta-data can be neither registered nor used if 
database software is not available. Also, in order to dis- 
play registered meta-data, dedicated software is 
required. Furthermore, when binary data is read out 
from the database, meta-data does not come with it, 
i.e., binary data without meta-data is formed. 
[0010] The aforementioned problems are also 
posed when, for example, image data is included as 
binary data, and area information that pertains to an 
image expressed by that image data is included as 
meta-data. 

[0011] Area information is data that pertains to a 
partial image obtained by extracting an area which has 
a uniform feature from a still image. This area informa- 
tion is used to detect and recognize objects in an image. 
However, when no area information is appended to still 
image data, a still image must undergo area segmenta- 
tion as a pre-process for detection and recognition of 
objects in an Image. Since area segmentation generally 
requires huge computer resources and long processing 
time, the efficiency is poor. 

[0012] For this reason, it is demanded to manage 
such area information in correspondence with still 
image data, as described above. However, when stili 
image data and corresponding area information are 
present as Independent files, the user must manage the 
still image data and area information at the same time 
upon moving or copying files, resulting in inconven- 
ience. 

[0013] Conventionally, no attempt to manage still 
image data and its area information, as described 
above, has been made. However, the method of manag- 
ing these two data in association with each other can be 
generally classified into a method of defining a new 
image format and a method of managing those data 
using a database, as has been explained for binary data 
and meta-data. 

[001 4] As examples of the method of defining a new 
binary format for stili image data, Tiff, Ex'rf, Flashpix, and 
the like are known as new image formats. Fig. 26 shows 
the concept of a format in which meta-data is embedded 
in binary data. That Is, Fig. 26 shows the concept of a 
format in which area information is embedded in still 
image data. In this manner, when a field for describing 
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area Information is allocated fn the header field of an 
Image, and area Information that pertains to the image 
is stored in that field, they can be managed as a single 
file, thus allowing relatively easy file management. 
[0015] On the other hand, image data and area 5 
information are managed using a database as follows. 
Fig. 27 shows the concept of a method of managing still 
image data and area information using a database. As 
shown in Fig. 27, a method of managing still image data 
and area information which are present as independent 10 
files using a database or the like is available. In this 
case, still image data is not modified at all, and existing 
still image data can be directly used by an existing appli- 
cation. 

[0016] However, the method of defining a new for- is 
mat that describes area Information, and the method of 
managing meta-data using a database respectively suf- 
fer the same problems as those of management of 
binary data and meta-data. 

[0017] That is, when a new format that describes 20 
area information is defined, existing still image data 
must be converted into the new format, and area infor- 
mation must be described in that new format Further- 
more, in order to use area information in the new format, 
an application compatible to the new format is required. 25 
That is, in order to describe and use area information, a 
large number of steps and dedicated environment are 
required. Also, in order to process still image data In 
such new format (e.g., to output an image if it Is image 
data), an application compatible to that format is 30 
required, and an existing application cannot cope with 
such situation. 

[0018] tn addition, since the description method of 
area information Is uniquely determined in the new for- 
mat, a new read routine of area information must be ere- 35 
ated to create an application that uses area information 
in the new format Furthermore, in order to describe 
area information in a new field, the specifications of the 
format must be changed. 

[001 9] On the other hand, when still Image data and ao 
area Information are simultaneously managed using a 
database, area Information can be neither appended 
nor used if database software is not available. Further- 
more, when only still image data is read out from the 
database, area information does not come with it, i.e., as 
still image data without area information is formed. 

SUMMARY OF THE INVENTION 

[0020] The present invention has been made in so 
consideration of the aforementioned problems posed in 
description and search of meta-data, and has as Its 
object to allow registration of meta-data in binary data 
without any influences on existing applications. 
[0021] It is another object of the present invention to 55 
provide binary data registered with meta-data in a for- 
mat that can be processed by an existing application. 
[0022] It is still another object of the present inven- 
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tbn to allow use of existing tools for a data description 
language by describing meta-data using a general data 
description language, so as to facilitate development of 
a compatible application. 

[0023] It Is still another object of the present inven- 
tion to extract meta-data from binary data In which the 
meta-data is described, so as to be able to use it tn 
processes such as search, reference, change, and the 
like. 

[0024] It Is still another object of the present inven- 
tion to allow more accurate discrimination of the pres- 
ence/absence of meta-data by registering confirmation 
information such as a check sum together with meta- 
data, and discriminating the meta-data using the confir- 
mation information. 

[0025] It is still another object of the present inven- 
tion to allow easy discrimination of binary data regis- 
tered with meta-data by registering meta-data and its 
size in binary data. 

[0026] It is still another object of the present inven- 
tion to allow appending of area Information to still Image 
data without any Influences on existing applications. 
[0027] It is still another object of the present inven- 
tion to provide still image data registered with area infor- 
mation In a format that can be processed by an existing 
application. 

[0028] It is still another object of the present inven- 
tion to allow use of existing tools for a data description 
language by describing area information using a gen- 
eral data description language, so as to facilitate devel- 
opment of a compatible application. 
[0029] It is still another object of the present inven- 
tion to extract area information from still image data in 
which the area information is described, so as to be able 
to use it in various processes. 

[0030] Other features and advantages of the 
present invention will be apparent from the following 
description taken in conjunction with the accompanying 
drawings, in which like reference characters designate 
the same or similar parts throughout the figures thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0031] The accompanying drawings, which are 
incorporated In and constitute a part of the specification, 
illustrate embodiments of the invention and, together 
with the description, serve to explain the principles of 
the Invention. 

Fig. 1 is a block diagram showing the arrangement 
of a data processing apparatus according to the 
first embodiment of the present invention; 
Fig. 2 is a flow chart for explaining a meta-data reg- 
istration process according to the first embodiment; 
Fig. 3 is a view for explaining the registration state 
of meta-data in binary data according to the first 
embodiment; 

Fig. 4 is a flow chart showing the discrimina- 
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tton/extraction sequence of the registered meta- 
data according to the second embodiment of the 
present invention; 

Rg. 5 Is a flow chart for explaining details of a meta- 
data discrimination process according to the sec- 5 
ond embodiment; 

Fig. 6 shows an example of the data format of 
binary data in which XML data is registered as 
meta-data; 

Rg. 7 shows an example of binary data registered io 
with meta-data which is described in a correct XML 
format; 

Rg. 8 is a flow chart for explaining a meta-data reg- 
istration process according to the third embodiment 
of the present invention; 15 
Rg. 9 is a view for explaining the registration state 
of meta-data in binary data according to the third 
embodiment; 

Fig. 10 is a flow chart for explaining details of a 
meta-data discrimination process according to the 20 
fourth embodiment of the present invention; 
Fig. 1 1 shows an example of the data format of 
binary data in which XML data is registered as 
meta-data in the fourth embodiment; 
Rg. 12 is a flow chart for explaining a meta-data 25 
registration process according to the fifth embodi- 
ment of the present invention; 
Rg. 13 is a view for explaining the registered state 
of meta-data in binary data according to the fifth 
embodiment; 30 
Rg. 14 is a flow chart for explaining details of a 
meta-data discrimination process according to the 
sixth embodiment of the present invention; 
Fig. 15 shows an example of the data format of 
binary data in which XML data is registered as 35 
meta-data in the sixth embodiment; 
Rg. 16 is a flow chart for explaining an appending 
process of area information according to the sev- 
enth embodiment of the present invention; 
Rg. 1 7 is a view for explaining the registered state ao 
of area information in still image data according to 
the seventh embodiment; 

Rg. 18 is a flow chart showing the discrimina- 
tion/extraction sequence of the registered area 
information according to the eighth embodiment of 45 
the present invention; 

Rg. 19 Is a flow chart for explaining details of an 
area information discrimination process according 
to the eighth embodiment; 

Rg. 20 shows an example of the data format of still so 

image data in which SVG data is appended as area 

information in the sixth embodiment; 

Rg. 21 is a flow chart for explaining a process for 

generating area information in a data format 

according to the 1 0th embodiment of the present 55 

invention; 

Rg. 22A shows an example of an image seg- 
mented into areas; 
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Rg. 22B Is a table showing an example of the data 
format of an area Information list according to the 
10th embodiment, which corresponds to the seg- 
mented areas shown in Rg. 22A; 
Fig. 23 is a view for explaining a data format of area 
Information according to the ninth embodiment of 
the present Invention; 

Rg. 24 shows the concept of a data format in which 
meta-data Is embedded In binary data by a general 
method; 

Rg. 25 shows the concept of a general method of 
managing binary data and meta-data using a data- 
base; 

Rg. 26 shows the concept of a format in which area 
information as meta-data is embedded in still image 
data as binary data; and 

Fig. 27 shows the concept of a method of managing 
still image data and area information using a data- 
base. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0032] Preferred embodiments of the present inven- 
tion will now be described in detail in accordance with 
the accompanying drawings. 

[First Embodiment] 

[Q033] Fig. 1 is a block diagram showing the 
arrangement of a data processing apparatus according 
to the first embodiment Referring to Fig. 1, reference 
numeral 100 denotes a reader, which reads an image 
using, e.g., a scanner device. Reference numeral 101 
denotes an input unit which is used by the user to input 
an instruction and data, and includes a keyboard and 
pointing device. Reference numeral 102 denotes a stor- 
age unit which stores binary data and meta-data. A hard 
disk is normally as the storage unit 102. Reference 
numeral 103 denotes a display which displays binary 
data stored in the storage unit 1 02, and image data read 
by the reader 100. A CRT or liquid crystal display device 
is normally used as the display 103. 
[0O34] Reference numeral 104 denotes a CPU 
which is Involved in all processes of the aforementioned 
building components, and a ROM 105 and RAM 106 
provide programs, data, or a work area required for 
such processes to the CPU 1 04. Note that a control pro- 
gram that implements the processing sequence of this 
embodiment (to be described later with reference to Rg. 
2) Is also stored in the ROM 105. Of course, that control 
program may be stored in the storage unit 1 02, and may 
be loaded onto the RAM 1 06 upon being executed by 
the CPU 104. 

[0035] Note that the data processing apparatus of 
the first embodiment has various other building compo- 
nents in addition to those described above, but they are 
not the gist of the present invention, and a detailed 
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description thereof will be omitted. 
[0036] A process for registering meta-data in binary 
data in the data processing apparatus with the afore- 
mentioned arrangement will be explained below. Fig. 2 
is a flow chart for explaining the meta-data registration 
process according to the first embodiment 
[0037] Referring to Fig. 2, binary data designated 
by the user is loaded onto a memory (RAM 1 06) in step 

51301. For example, the binary data is designated by 
inputting a desired binary data file name at the key- 
board, or indicating an icon of the corresponding binary 
data using the pointing device (e.g., a mouse), in step 

51302, an XML file which is designated by the user and 
describes meta-data is loaded onto the memory (RAM 
106). This XML file Is also designated by inputting a file 
name at the keyboard or Indicating the corresponding 
icon by the pointing device (e.g., a mouse). 

[0038] It Is checked in step S1303 if the XML file 
that describes meta-data is XML data in a correct for- 
mat. The correct format is discriminated by checking if 
the description format of an XML file is satisfied (e.g., if 
the right and left parentheses of tags correctly form 
pairs, if a tag assignment format is correct, and so 
forth). Note that it is also checked if XML data is correct 
as well as if it has a correct format. Whether or not XML 
data is correct can be determined by checking, e.g., if 
XML data is described in accordance with a schema 
such as DTD (Document Type Definition) or the like. 
[0039] If it is determined in step S1303 that the XML 
file Is not XML data in the correct format, the flow 
advances to step S1305. In step S1305, an XML data 
error is displayed on the display 103, thus ending this 
process. 

[0040] On the other hand, If it is determined in step 
Si 303 that the XML file is XML data in the correct for- 
mat, the flow advances to step S1304. In step S1304, 
meta-data is registered by connecting that meta-data to 
the end of the binary data loaded onto the memory in 
step S1301. After that, the binary data registered with 
the meta-data is output in step S1306, thus ending the 
process. Note that data having a data structure shown 
in Fig. 3 is stored as one file in the storage unit 1 02 upon 
outputting data in step S1306. 

[0041] Fig. 3 Is a view for explaining the registration 
state of meta-data in binary data according to the first 
embodiment. As shown in Fig. 3, by connecting meta- 
data described in XML data to the end of binary data, 
meta-data can be registered without any Influences on 
other applications. More specifically, since information 
in the header field of binary data remains the same as 
that before connection of meta-data, if binary data is, 
e.g., image data, an image can be reproduced by a con- 
ventional browser (connected meta-data is ignored). 
[0042] Furthermore, since meta-data is descrfced 
in XML, if a tool that can Interpret XML data is available, 
meta-data can be added, changed, and referred to, 
when the XML data contents are extracted, thus assur- 
ing very high compatibility. Note that extraction of XML 
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data contents will be described in detail in the second 
embodiment 

[0043] As described above, according to the first 
embodiment, when meta-data is described in XML and 

5 that XML data is connected to the end of binary data, 
meta-data can be registered in existing binary data. 
[0044] More specifically, according to the first 
embodiment, by connecting meta-data which is 
described in a correct format in a predetermined data 

10 description language to the end of binary data, meta- 
data can be registered to existing binary data without 
any influence on existing applications. That is, binary 
data registered with meta-data can be provided in a for- 
mat that can be processed by an existing application. 

15 When an existing data description language is used as 
that for describing meta-data, existing tools of that data 
description language can be used upon edit, reference, 
and the like of meta-data, thus saving extra efforts for 
development 

20 

[Second Embodiment] 

[0045] In the first embodiment, the method of regis- 
tering meta-data to binary data has been explained. In 

25 the second embodiment, a process for discriminating 
whether or not meta-data is registered to binary data, 
and extracting the meta-data if it is registered wifl be 
explained. Note that the arrangement of the data 
processing apparatus in the second embodiment is the 

30 same as that in the first embodiment (Fig. 1), and a 
detailed description thereof will be omitted. 
[0046] The processes for discriminating whether or 
not meta-data that has been explained in the first 
embodiment is registered In data of a designated file, 

35 and extracting the registered meta-data will be 
explained below. Fig. 4 is a flow chart showing the dis- 
crimination/extraction sequence of registered meta- 
data according to the second embodiment. In this 
embodiment, the extracted meta-data is displayed on 

ao the display 1 03, but the present invention is not limited 
to such specific output mode. For example, it Is known 
to those who are skilled in the art that the extracted 
meta-data may be provided to a search process. 
[0047] Referring to Fig. 4, a file which is to undergo 

45 discrimination as to whether or not meta-data is regis- 
tered, i.e., data to be processed, is designated by a user 
instruction in step S501 . The data to be processed is 
designated in step S501 by inputting the file name of the 
binary data at the keyboard or indicating a correspond- 
so ing icon using the pointing device (mouse). 

[0048] It is discriminated in step S502 if meta-data 
described in XML is registered in data of the designated 
file. Details of the discrimination process in step S502 
will be explained below with reference to the flow chart 

55 In Fig. 6 and a schematic view in Fig. 6. Fig. 5 is a flow 
chart for explaining details of the meta-data discrimina- 
tion process according to the second embodiment Fig. 
6 shows an example of the data format of binary data in 
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which XML data Is registered as meta-data. 
[0049] As has been explained in the first embodi- 
ment, the data format of the data to be processed In 
which XML data as meta-data is registered is as shown 
in Fig. 6. Hence, the presence/absence of meta-data is 
discriminated as follows. 

[0050] As shown in Fig. 5, the entire data of the file 
designated in step S501 (the entire data to be proc- 
essed) is loaded onto the memory (RAM 1 06) in step 
S1601. Since the data output in step S1306 in the first 
embodiment is managed as a single file, the entire data 
can be read out by a general file management system. 
[0051] It Is checked in step S1602 If a character 
string "</PhotoXML>" is present at the end of data loaded 
in step S1601. If such character string is not present, 
the flow advances to step S1 605. 
[0052] On the other hand, if the character string 
VPhotoXMD" is present at the end of the loaded data to 
be processed, the flow advances to step S1603. ft is 
checked in step S1603 if a character string "(Pho- 
toXML) u is present before the character string VPho- 
toXML)", and it is also confirmed if data bounded by 
these character strings is described in a correct XML 
format. At this time, it may be further checked if that data 
is correct XML data Whether or not the data has a cor- 
rect format and is correct can be discriminated, as has 
been explained in the first embodiment (step S1303). 
[0053] If it is confirmed in step S1 603 that the data 
has a correct format, the flow advances to step S1 604. 
It is concluded in step S1604 that meta-data is regis- 
tered, and this process ends. On the other hand, if a 
correct format is not confirmed in step S1603, the flow 
advances to step SI 605. It is concluded in step SI 605 
that meta-data is not registered. That is, if it is deter- 
mined in step S1602 that the character string VPho- 
toXML)" is not present at the end of the binary data of 
interest, if it is determined in step S1 603 that the char- 
acter string "(PhotoXML)" is not present, or if it is deter- 
mined in step S1603 that described contents bounded 
by the character strings are incorrect, the flow advances 
to step S1605 to determine that no meta-data is regis- 
tered in the data to be processed. 
[0054] In this manner, discrimination of meta-data 
ends. 

[0055] Referring back to the flow chart in Fig. 4, if it 
is concluded by the process shown in the flow chart in 
Fig. 5 that meta-data is registered, the flow advances to 
step S503. In step S503, the contents of the registered 
meta-data are displayed on the basis of XML data 
bounded by the character strings "(PhotoXML)" and 
VPhotoXML) 0 , thus ending the process. On the other 
hand, if it is determined in step S502 that no meta-data 
is registered, the process ends. 
[0056] Fig. 7 shows an example of binary data with 
meta-data, I.e., in which meta-data described in a cor- 
rect XML format is registered. The meta-data shown in 
Fig. 7 records information such as "photographer", 
"date", "title", "keyword", and the like of an Image. 



Therefore, by discriminating meta-data in the aforemen- 
tioned sequence, a search and the like can be made 
using such information. Furthermore, Information such 
as "shutter speed", "exposure time", and the like upon 
5 capturing an image can be recorded in the meta-data, 
and an image can be automatically corrected using 
these pieces of information. 

[0057] As described above, according to the sec- 
ond embodiment, binary data with meta-data, and nor- 
10 mal binary data can be discriminated by checking if 
XML data is described at the end of data in a correct for- 
mat If meta-data is found, that meta-data can be dis- 
played. 

[0058] More specifically, according to the second 
is embodiment, binary data registered with meta-data can 
be discriminated from that registered with no meta-data, 
and the registered meta-data can be extracted. There- 
fore, when an existing data description language is used 
as that for describing meta-data, existing tools of that 
20 data description language can be used in a search 
process using meta-data, thus saving extra efforts for 
development. 

[0059] In each of the above embodiments, XML 
data is used as meta-data. However, the present inven- 
ts tion is not limited to such specific data. For example, 
other data description languages such as SGML, 
HTML, and the like may be used. Also, still image data, 
moving image data, audio data, and the like can be 
used as binary data. 

30 

[Third Embodiment] 

[0060] In the first embodiment, meta-data in the 
XML format is simply connected to the end of binary 
35 data. In the third embodiment, meta-data and Its check 
sum are appended to the end of binary data. In this 
manner, meta-data can be registered in existing binary 
data without any influences on an existing application, 
and the presence/absence of meta-data can be discrim- 
40 inated more reliably. 

[0061] Note that the arrangement of the data 
processing apparatus in the third embodiment is the 
same as that in the first embodiment (Fig. 1), and a 
detailed description thereof will be omitted. 
45 [0062] Fig. 8 is a flow chart for explaining a meta- 
data registration process according to the third embodi- 
ment. 

[0063] Referring to Fig. 8, binary data designated 
by the user is loaded onto the memory (RAM 106) in 
so step S2301 . In this case, the binary data is designated 
by inputting a desired binary data file name at the key- 
board, or Indicating an icon of the corresponding binary 
data using the pointing device (e.g., a mouse). In step 
S2302, an XML file which is designated by the user and 
55 describes meta-data is loaded onto the memory (RAM 
106). This XML file is also designated by Inputting a file 
name at the keyboard or indicating the corresponding 
icon by the pointing device (e.g., a mouse). 
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[0064] It is checked In step S2303 If the XML file 
that describes meta-data Is XML data In a correct for- 
mat. The correct format Is discriminated by checking if 
the description format of an XML file is satisfied (e.g., if 
the right and left parentheses of tags correctly form 5 
pairs, if a tag assignment format is correct, and so 
forth). Note that it is also checked if XML data is correct 
as well as if it has a correct format. Whether or not XML 
data is correct can be determined by checking, e.g., if 
XML data is described in accordance with a schema 
such as DTD (Document Type Definition) orthe like. 
[0085] If it is determined in step S2303 that the XML 
file is not XML data in the correct format, the flow 
advances to step S2305. tn step S2305, an XML data 
error is displayed on the display 103, thus ending this 
process. 

[0086] On the other hand, if it is determined in step 
S2303 that the XML file is XML data in the correct for- 
mat, the flow advances to step S2304. In step S2304, a 
check sum of the meta-data of interest is computed. As 
the check sum, a numerical value that can be uniquely 
determined from the meta-data (for example, the meta- 
data size (the number of bytes), the number of charac- 
ters, the number of words, the number of lines, the 
number of items bounded by tags, and the like) is used. 
If two or more of these are used at the same time, a 
check sum with higher precision can be obtained. Also, 
the numerical values used as the check sum (or the 
method of computing the check sum) may be described 
as XML data for the meta-data discrimination process to 
be described later in the fourth embodiment. 
[0067] In step S2306, the meta-data and check sum 
are registered at the end of the binary data loaded in 
step S2301. Furthermore, the binary data obtained by 
registering the meta-data is output in step S2307, thus 
ending the process. 

[0068] Fig. 9 is a view for explaining the registered 
state of meta-data in binary data according to this 
embodiment. As shown in Rg. 9, a check sum and 
meta-data described in XML data are connected to the 
end of binary data In this manner, meta-data can be 
registered without any influences on other applications. 
More specifically, assuming that binary data is standard 
JPEG Image data, even when meta-data is appended to 
the end of that image data, an image can be loaded by 
a commercially available image application (that does 
not recognize meta-data) without posing any problems. 
That is, since information in the header field of binary 
data remains the same as that before connection of 
meta-data, if binary data is, e.g., image data, an image 
can be reproduced by a conventional browser (con- 
nected meta-data is ignored). 
[0069] Furthermore, since meta-data is described 
in XML, if a tool that can interpret XML data is available, 
meta-data can be added, changed, and referred to, 55 
when the XML data contents are extracted, thus assur- 
ing very high compatibility. Note that extraction of XML 
data contents registered In this way will be described in 



detail In the fourth embodiment 
[0070] As described above, according to the third 
embodiment, when meta-data Is described in XML and 
that XML data is connected to the end of binary data, 
meta-data can be registered in existing binary data. 
[0071] More specifically, according to the third 
embodiment, by connecting meta-data (data In an XML 
format and check sum) which is described in a correct 
format in a predetermined data description language to 
the end of binary data, meta-data can be registered in 
existing binary data without any influence on existing 
applications. That is, binary data registered with meta- 
data can be provided in a format that can be processed 
by an existing application. When an existing data 
description language is used as that for describing 
meta-data, existing tools of that data description lan- 
guage can be used upon edit, reference, and the like of 
meta-data, thus saving extra efforts for development. 
[0072] In the above embodiment, correctness of 
XML data is checked, but whether or not data Is well- 
formed may be checked instead. Also, the meta-data 
format is not limited to XML, and other formats such as 
SGML, HTML, and the like may be used. 



[0073] in the third embodiment, the method of reg- 
istering meta-data in binary data has been explained. In 
the fourth embodiment, a process for discriminating 
whether or not meta-data is registered to binary data, 
and extracting the meta-data If It Is registered will be 
explained. Note that the arrangement of the data 
processing apparatus in the fourth embodiment is the 
same as that in the first embodiment (Rg. 1), and a 
detailed description thereof will be omitted. 
[0074] The processes for discriminating whether or 
not meta-data that has been explained in the third 
embodiment is registered in data of a designated file, 
and extracting the registered meta-data will be 
explained below. The meta-data determination/extrac- 
tion sequence of the fourth embodiment is the same as 
that shown in Fig. 4 in the second embodiment, and will 
be explained while citing Fig. 4. In this embodiment, the 
extracted meta-data is displayed on the display 1 03, but 
the present invention Is not limited to such specific out- 
put mode. For example, it is known to those who are 
skilled in the art that the extracted meta-data may be 
provided to a search process. 
[0075] Referring to Fig. 4, a file which is to undergo 
discrimination as to whether or not meta-data is regis- 
tered is designated by a user instruction in step S501. 
The file to be processed (data to be processed) is des- 
ignated in step S501 by inputting the file name of the 
binary data at the keyboard or indicating a correspond- 
ing icon using the pointing device (mouse). 
[0076] It is discriminated in step S502 if meta-data 
described in XML is registered in data of the designated 
file. Details of the discrimination process in step S502 
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will be explained below with reference to the flow chart 
In Fig. 10 and a schematic view In Fig. 11. Fig. 10 is a 
flow chart for explaining details of the meta-daia dis- 
crimination process according to the fourth embodi- 
ment. Fig. 11 shows an example of the data format of 
binary data in which XML data is registered as meta- 
data. 

[0077] As has been explained in the third embodi- 
ment the data format of the data to be processed in 
which XML data as meta-data is registered Is as shown 
in Fig. 1 1 . Hence, the presence/absence of meta-data is 
discriminated as follows. 

[0078] As shown I n Fig. 1 0 t the entire data of the file 
designated in step S501 {the entire data to be proc- 
essed) is loaded onto the memory (RAM 106) In step 
S2601 . Since the data output in step S2307 in the third 
embodiment Is managed as a single file, the entire data 
can be read out by a general file management system. 
[0079] It is checked in step S2602 if a character 
string "(/PhotoXML)" is present at the end of data loaded 
in step S2601. If such character string is not present, 
the flow advances to step S2608, and it is concluded 
that no meta-data is registered in that data to be proc- 
essed. 

[0080] On the other hand, if the character string 
"(/PhotoXML)" is present at the end of the loaded data, 
the flow advances to step S2603. It Is checked in step 
S2603 if a character string "(PhotoXML)" is present 
before the character string "(/PhotoXML)", and It is also 
confirmed if data bounded by these character strings is 
described in a correct XML format. At this time, it may 
be further checked if that data Is correct XML data. 
Whether or not the data has a correct format and is cor- 
rect can be discriminated, as has been explained in the 
third embodiment (step S2303). 
[0081] If it is confirmed in step S2603 that the data 
has a correct format, the flow advances to step S2604. 
In step S2604, a check sum stored immediately before 
the character string <PhotoXML) is loaded, and is held 
as check sum 1 . Subsequently, In step S2605 a check 
sum of the meta-data which has been confirmed to have 
the correct format is computed, and is held as check 
sum 2. Note that the check sum is computed by the 
same method as that used in the process upon connect- 
ing meta-data described in the third embodiment. For 
example, when "the number of characters" is used as 
the check sum upon connecting meta-data, "the number 
of characters u Is computed as a check sum in this proc- 
ess. 

[0082] In step S2606, check sums 1 and 2 are com- 
pared, and if they are equal to each other, the flow 
advances to step S2607 to conclude that meta-data is 
registered. On the other hand, if the two check sums are 
not equal to each other, the flow advances to step 
S260B. That is, if It is determined In step S2602 that the 
character string "(/PhotoXML)" Is not present at the end 
of the binary data of Interest, if it is determined in step 
S2603 that the character string "(PhotoXML)" is not 



present, If It ts determined in step S2603 that described 
contents bounded by the character strings are incorrect, 
or if it is determined in step S2606 that the two check 
sums are not equal to each other, the flow advances to 
5 step S2608 to conclude that no meta-data is registered 
in the data to be processed. 

[0083] In this manner, discrimination of meta-data 
ends. 

[0084] Referring back to the flow chart in Fig. 4, if it 

w is concluded by the process shown in the flow chart in 
Fig. 10 that meta-data is registered, the flow advances 
to step S503. In step S503, the contents of the regis- 
tered meta-data are displayed on the basis of XML data 
bounded by the character strings "(PhotoXML)" and 

is "(/PhotoXML)", thus ending the process. On the other 
hand, if it is determined in step S502 that no meta-data 
is registered, the process ends. 
[0085] As described above, according to the fourth 
embodiment, binary data with meta-data, and normal 

20 binary data can be discriminated by checking if XML 
data is described at the end of data in a correct format. 
If meta-data is found, that meta-data can be displayed. 
[0086] More specifically, according to the fourth 
embodiment, binary data registered with meta-data can 

25 be discriminated from that registered with no meta-data, 
and the registered meta-data can be extracted. There- 
fore, when an existing data description language is used 
as that for describing meta-data, existing tools of that 
data description language can be used in a search 

30 process using meta-data, thus saving extra efforts for 
development 

[0087] According to the above embodiment, since a 
check mechanism such as a check sum or the like for 
discriminating meta-data is provided, the meta-data can 

35 be discriminated more accurately. 

[0088] In the above embodiment, XML data is used 
as meta-data. However, the present invention is not lim- 
ited to such specific data. For example, other data 
description languages such as SGML, HTML, and the 

40 like may be used. Of course, the character strings used 
({PhotoXML) and (/PhotoXML) are used in the above 
embodiment) and the way correctness is discriminated 
change depending on the data description language 
used, and are not particularly limited by the above 

45 embodiments. Also, still image data, moving image 
data, audio data, and the like can be used as binary 
data. 

[0089] As has been explained in the third embodi- 
ment, various check sums can be used. On the other 

so hand, a check sum mode (e.g., "the number of charac- 
ters" or the like) may be described in an entity of meta- 
data (in XML data in the above embodiment), and a 
check sum may be computed according to that descrip- 
tion upon data discrimination. 

55 [0090] In the fourth embodiment, the pres- 
ence/absence of meta-data Is discriminated using a 
check sum. Also, by confirming correctness (e.g., valid- 
ity or well-formedness) of XML itself, more accurate dis- 
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crimination can be achieved. 

[Fifth Embodiment] 

[0091] In the third embodiment, a check sum Is reg- 
istered to discriminate the presence/absence of meta- 
data accurately. In this embodiment, the pres- 
ence/absence of meta-data is discriminated more accu- 
rately using size information and a marker code. 
[0092] Note that the arrangement of the data 
processing apparatus in the fifth embodiment is the 
same as that in the first embodiment (Fig. 1), and a 
detailed description thereof will be omitted. 
[0093] Fig. 1 2 is a flow chart for explaining a meta- 
data registration process according to the fifth embodi- 
ment Referring to Fig. 12, binary data designated by 
the user is loaded onto the memory (RAM 106) In step 

53301 . In this case, the binary data is designated by 
inputting a desired binary data file name at the key- 
board, or indicating an icon of the corresponding binary 
data using the pointing device (e.g., a mouse). In step 

53302, an XML file which Is designated by the user and 
describes meta-data is loaded onto the memory (RAM 
106). This XML file is also designated by inputting a file 
name at the keyboard or indicating the corresponding 
icon by the pointing device (e.g., a mouse). 

[0094] It is checked in step S3303 if the XML file 
that describes meta-data is XML data in a correct for- 
mat. The correct format is discriminated by checking if 
the description format of an XML file is satisfied (e.g., if 
the right and left parentheses of tags correctly form 
pairs, if a tag assignment format is correct, and so 
forth). Note that it Is also checked if XML data is correct 
as well as if it has a correct format. Whether or not XML 
data is correct can be determined by checking, e.g., if 
XML data is described in accordance with a schema 
such as DTD (Document Type Definition) or the like. 
[0095] If ft is determined in step S3303 that the XML 
file is not XML data in the correct format, the flow 
advances to step S3305. In step S3305, an XML data 
error is displayed on the display 103, thus ending this 
process. 

[0096] ' On the other hand, if it is determined in step 
S3303 that the XML file is XML data in the correct for- 
mat, the flow advances to step S3304. In step S3304, 
the size of the meta-data of interest (XML data in this 
embodiment) is computed. In step S3306, the meta- 
data loaded in step S3302 and the size information 
computed in step S3304 are connected and registered 
after the end of the binary data loaded in step S3301 via 
a predetermined marker code. Furthermore, the binary 
data obtained by registering the meta-data is output in 
step S3307, and data having a data structure (to be 
described below with reference to Fig. 13) is stored as 
one file in the storage unit 102. 
[0097] The method of registering meta-data in 
binary data according to the fifth embodiment will be 
explained below. Fig. 13 is a view for explaining the reg- 
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Istered state of meta-data in binary data according to 
the fifth embodiment. In the fifth embodiment, as shown 
in Fig. 13, a predetermined 4-byte marker code 
(0x4C4D5850 in this embodiment) is connected to the 
5 end of binary data, meta-data described in XML is con- 
nected thereafter, and 4-byte numerical value data serv- 
ing as size information is connected after that meta- 
data. 

[0098] In this manner, meta-data can be registered 
10 without any influences on other applications. More spe- 
cifically, assuming that binary data is standard JPEG 
image data, even when meta-data is appended to the 
end of that image data, an image can be loaded by a 
commercially available Image application (that does not 
15 recognize meta-data) without posing any problems. 
That is, since information in the header field of binary 
data remains the same as that before connection of 
meta-data, if binary data is, e.g., image data, an image 
can be reproduced by a conventional browser (con- 
20 nected meta-data is ignored). 

[0099] Furthermore, since meta-data Is described 
in XML, if a tool that can interpret XML data is available, 
meta-data can be added, changed, and referred to, 
when the XML data contents are extracted, thus assur- 
es ing very high compatibility. Note that extraction of XML 
data contents will be described in detail in the sixth 
embodiment. 

[0100] As described above, according to the fifth 
embodiment, since predetermined data (marker) is 

30 appended to the head of meta-data (XML data) 
described in a correct format of a predetermined data 
description language, and data indicating the size of the 
meta-data is appended to the end of that meta-data, 
meta-data can be registered to existing binary data 

35 without any influences on existing applications. That is, 
binary data registered with meta-data can be provided 
in a format that can be processed by an existing appli- 
cation. 

[0101] The presence/absence of meta data can be 
40 discriminated more easily and quickly using the size 
Information and marker appended to XML data, and 
meta-data can be extracted. 

[01 02] When an existing data description language 
Is used as that for describing meta-data, existing tools 

45 of that data description language can be used upon 
edit, reference, and the like of meta-data, thus saving 
extra efforts for development. 
[0103] Note that the correctness of XML data is 
checked in this embodiment, but whether or not data is 

so well-formed may be checked instead. Also, the meta- 
data format is not limited to XML, and other formats 
such as SGML, HTML, and the like may be used. Also, 
the size information may be expressed by a tittle- or big- 
endian binary format or a text format. 

55 

[Sixth Embodiment] 

[0104] In the fifth embodiment, the method of regis- 
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terlng meta-data in binary data has been explained. In 
the sixth embodiment, a process for discriminating 
whether or not meta-data is registered to binary data, 
and extracting the meta-data if it is registered will be 
explained. Note that the arrangement of the data 5 
processing apparatus in the sixth embodiment is the 
same as that in the first embodiment (Fig. 1 ), and a 
detailed description thereof will be omitted. 
[0105] The processes for discriminating whether or 
not meta-data that has been explained in the fifth 10 
embodiment is registered in data of a designated file, 
and extracting the registered meta-data will be 
explained below. The meta-data determination/extrac- 
tion sequence of the sixth embodiment is the same as 
that shown in Fig. 4 in the second embodiment, and will 15 
be explained while applying Fig. 4. In this embodiment, 
the extracted meta-data is displayed on the display 103, 
but the present invention is not limited to such specific 
output mode. For example, it is known to those who are 
skilled in the art that the extracted meta-data may be 20 
provided to a search process. 
[0106] Referring to Fig. 4, a file which is to undergo 
discrimination as to whether or not meta-data is regis- 
tered (data to be processed) is designated by a user 
instruction in step S501 . The file to be processed is des- 25 
ignated in step S501 by inputting the file name of the 
binary data at the keyboard or indicating a correspond- 
ing icon using the pointing device (mouse). 
[0107] It is discriminated in step S502 if meta-data 
described in XML is registered in data of the designated 30 
file. Details of the discrimination process in step S502 
wilt be explained below with reference to the flow chart 
in Fig. 14 and a schematic view In Rg. 15. Fig. 14 is a 
flow chart for explaining details of the meta-data dis- 
crimination process according to the sixth embodiment 35 
Rg. 15 shows an example of the data format of binary 
data in which XML data is registered as meta-data. 
[0108] As has been explained in the fifth embodi- 
ment, the data format of the data to be processed in 
which XML data as meta-data is registered is as shown 40 
in Fig. 15. In this embodiment, assume that 4 bytes are 
assigned to each of size information and a marker code. 
The presence/absence of meta-data in such binary data 
is discriminated as follows. 

[0109] As shown in Rg. 1 4, the entire data of the file 45 
designated in step S501 is loaded onto the memory 
(RAM 1 06) in step S3601 . Since the data output in step 
S3307 in the fifth embodiment Is managed as a single 
file, the entire data can be read out by a general file 
management system. 50 
[0110] In step S3602, 4 bytes at the end of the 
loaded data to be processed are determined as size 
information of meta-data, and are stored in variable S. If 
meta-data is stored in the format shown In Fig. 15, a 
marker code should be stored in 4 bytes, which have a 55 
head position obtained by adding 8 (bytes) to variable S 
from the end of the data to be processed toward the 
head. Therefore, 4 bytes are read from the (S + 8)-th 



byte from the end of the data to be processed, and are 
compared with a marker code (0x4C4D5850) in step 
S3603. If they are equal to each other, the flow 
advances to step S3604; otherwise, the flow advances 
to step S3607. 

[0111] In step S3604, the presence of the marker 
code Is determined, and S bytes are extracted from the 
(S + 4)-th byte from the end of the data to be processed 
and are determined as meta-data. It Is checked in step 
S3605 if the extracted meta-data follows the rules of the 
description language. For example, if meta-data is 
described in XML, correctness (or well-formedness) of 
its contents is checked. In this case, for example, the 
processes In steps S1602 and S1603 in the second 
embodiment or in steps S2602 and S2603 In the fourth 
embodiment can be applied. If the meta-data follows the 
rules, the flow advances to step S3606 to conclude that 
meta-data is registered. 

[0112] On the other hand, if it is determined in step 
S3603 that 4-byte data read from the (S + 8)-th byte 
from the end of the data to be processed is not equal to 
the marker code (Ox4C4D5850), or if it is determined in 
step S3605 that the contents extracted as meta-data do 
not follow the rules of the description language, the flow 
advances to step S3607. In step S3607, it is concluded 
that no meta-data is registered. 
[0113] In this manner, discrimination of meta-data 
ends. 

[01 14] Referring back to the flow chart in Fig. 4, if it 
is concluded by the process shown in the flow chart In 
Fig. 14 that meta-data Is registered, the flow advances 
to step S503. In step S503, the contents of the regis- 
tered meta-data are displayed on the basis of the S-byte 
data (In this embodiment, data described in XML) 
extracted in step S3604, thus ending the process. On 
the other hand, if it is determined in step S502 that no 
meta-data is registered, the process ends. 
[0115] As described above, according to the sixth 
embodiment, data is extracted from the data to be proc- 
essed on the basis of the size indicated by data stored 
at the end of the data to be processed, and the pres- 
ence/absence of meta-data is determined by checking if 
the extracted data is described in a correct format of the 
description language (e.g., XML). For this reason, meta- 
data can be discriminated and extracted more easily. 
[0116] If the presence of meta-data is determined, 
that meta-data can be displayed. 
[0117] According to the sixth embodiment, binary 
data registered with meta-data can be discriminated 
from that registered with no meta-data, and the regis- 
tered meta-data can be extracted. Therefore, when an 
existing data description language is used as that for 
describing meta-data, existing tools of that data descrip- 
tion language can be used in a search process using 
meta-data, thus saving extra efforts for development. 
[0118] In each of the above embodiments, XML 
data is used as meta-data However, the present inven- 
tion Is not limited to such specific data. For example, 
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other data description languages such as SGML, 
HTML, and the like may be used. Also, still image data, 
moving image data, audio data, and the like can be 
used as binary data. 

[0119] The fifth embodiment inserts a marker code s 
indicating the position of meta-data immediately before 
m eta-data, and the sixth embodiment extracts meta- 
data using that marker code. However, the marker code 
is not always required, but may be omitted. If no marker 
code is inserted, marker code discrimination in step 10 
S3603 can be merely skipped in the sixth embodiment, 
thus obtaining the same discrimination result. However, 
in such case, the load on the discrimination routine 
which determines If no mela-data is present increases. 

[Seventh Embodiment] 

[0120] Each of the first to sixth embodiments 
described above allows to append meta-data to binary 
data without any influences on existing applications. In 20 
the seventh embodiment, a case will be described 
below wherein still image data is used as binary data, 
and data representing area information of the stili image 
data is used as meta-data. 

[0121] Note that the arrangement of the data 25 
processing apparatus in the seventh embodiment is the 
same as that in the first embodiment (Fig. 1), and a 
detailed description thereof will be omitted. The storage 
unit 102 especially stores still Image data and area 
information. 30 
[0122] Fig. 16 Is a flow chart for explaining a proc- 
ess for appending area information according to the 
seventh embodiment 

[0123] Referring to Fig. 16, still image data desig- 
nated by the user is loaded onto the memory (RAM 1 06) 35 
in step S4301 . For example, the still image data is des- 
ignated by inputting a desired still image data file name 
at the keyboard, or indicating an icon of the correspond- 
ing still image data using the pointing device (e.g., a 
mouse). In step S4302, an SVG file which is designated aq 
by the user and describes area information is loaded 
onto the memory (RAM 106). This SVG file is also des- 
ignated by inputting a file name at the keyboard or indi- 
cating the corresponding icon by the pointing device 
(e.g., a mouse). as 
[0124] it is checked in step S4303 if the SVG file 
that describes area information is correct SVG data. 
The correct format is discriminated by checking if the 
description format of an SVG file is satisfied (e.g., if the 
right and left parentheses of tags correctly form pairs, If so 
a tag assignment format is correct, and so forth). 
[0125] If it Is determined in step S4303 that the SVG 
file is not correct SVG data, the flow advances to step 
S4305. In step S4305, an SVG data error is displayed 
on the display 1 03, thus ending this process. 55 
[0126] On the other hand, if it is determined In step 
S4303 that the SVG file is correct SVG data, the flow 
advances to step S4304. In step S4304, area informa- 
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tlon is appended by connecting data in that SVG file 
(area Information) to the end of the still Image data 
loaded onto the memory In step S4301. After that, the 
still image data registered with the area information is 
output In step S4306, thus ending the process. Note 
that data having a data structure shown in Fig. 17 is 
stored as one file in the storage unit 102 upon outputting 
data in step S4306. 

[0127] Fig. 17 is a view for explaining the registra- 
tion state of area information in still image data accord- 
ing to this embodiment. As shown in Fig. 17, by 
connecting SVG data serving as area information to the 
end of still image data, area information can be regis- 
tered without any influences on other applications. More 
specifically, since information in the header field of still 
image data remains the same as that before connection 
of area information, an Image can be reproduced by a 
conventional browser (connected area information is 
ignored). 

[0128] Furthermore, since area information is SVG 
data, if a tool that can interpret SVG data is available, 
area information can be added, changed, and referred 
to, when the SVG data contents are extracted, thus 
assuring very high compatibility. Note that extraction of 
SVG data contents will be described in detail in the 
eighth embodiment. 

[0129] As described above, according to the sev- 
enth embodiment, when area information which is 
described in a correct format of a predetermined data 
description language is connected to the end of still 
image data, area information can be appended to exist- 
ing still image data. That is, still image data appended 
with area information can be provided in a format that 
can be processed by an existing application. Since the 
area information and still image data can be managed 
as a single file, management of area information is facil- 
itated. When an existing data description language is 
used as that for describing area information, existing 
tools of that data description language can be used 
upon edit, reference, and the like of area information, 
thus saving extra efforts for development. 
[0130] Note that the format of the still image data is 
not particularly limited (e.g., JPEG, GIF, and the like 
may be used). 

[Eighth Embodiment] 

[0131] In the seventh embodiment, the method of 
registering area information in still image data has been 
explained. In the eighth embodiment, a process for dis- 
criminating whether or not area information is registered 
in still image data, and extracting the area information if 
it is registered will be explained. Note that the arrange- 
ment of the data processing apparatus in the eighth 
embodiment is the same as that in the first embodiment 
(Fig. 1), and a detailed description thereof wilt be omit- 
ted. 

[01 32] The processes for discriminating whether or 



EP 1045 315 A2 



11 



BNSDCCID: <EP 1045315A2_I_> 



21 



EF 1 045 315 A2 



22 



not area Information that has been explained In the sev- 
enth embodiment is registered In data of a designated 
file, and extracting the registered area information will 
be explained below. Rg. 18 is a flow chart showing the 
discrimination/extraction sequence of registered area 
information according to the eighth embodiment In this 
embodiment, the extracted area information is dis- 
played on the display 103, but the present invention is 
not limited to such specific output mode. For example, tt 
is known to those who are skilled In the art that the 
extracted area information may be provided to various 
other processes such as recognition of objects in an 
image. 

[0133J Referring to Rg. 18, a file which is to 
undergo discrimination as to whether or not area infor- 
mation is registered, i.e., data to be processed, is desig- 
nated by a user instruction in step S4501. The data to 
be processed is designated in step S4501 by inputting 
the file name of the still image data at the keyboard or 
indicating a corresponding Icon using the pointing 
device (mouse). In this embodiment, in order to discrim- 
inate normal still image data from still image data 
appended with area Information that has been 
explained in the seventh embodiment, the former will be 
referred to as still image data, and the latter will be 
referred to as data to be processed. 
[0134] It is discriminated in step S4502 if area infor- 
mation described in SVG is appended to data of the 
designated file. Details of the discrimination process in 
step S4502 will be explained below with reference to the 
flow chart in Rg. 19 and a schematic view in Rg. 20. 
Rg. 19 is a flow chart for explaining details of the area 
Information discrimination process according to the 
eighth embodiment Rg. 20 shows an example of the 
data format of stilt image data to which SVG data is 
appended as area information. 
[0135] As has been explained in the seventh 
embodiment, the data format of the data to be proc- 
essed in which SVG data as area information Is regis- 
tered is as shown in Rg. 20. Hence, the 
presence/absence of area information is discriminated 
as follows. 

[0136] As shown in Fig. 1 9, the entire data of the file 
designated in step S4501 (the entire data to be proc- 
essed) Is loaded onto the memory (RAM 106) in step 
S4601 . Since the data output in step S4306 in the sev- 
enth embodiment is managed as a single file, the entire 
data can be read out by a general file management sys- 
tem. 

[0137] It is checked in step S4602 If a character 
string VAreaJnfo)" is present at the end of data loaded 
in step S4601. If such character string is not present, 
the flow advances to step S4605. 
[0138] On the other hand, if the character string 
VAreaJnfo) 0 Is present at the end of the loaded data to 
be processed, the flow advances to step S4603. It Is 
checked In step S4603 if a character string "(Areajnfo)" 
is present before the character string VAreaJnfo) 0 . In 



this manner, it is checked if data bounded by the char- 
acter strings "<Area_lnfo>" and 'VAreaJnfo)", i.e., SVG 
data, is present Furthermore, tt is discriminated by 
checking pairing of tags if the data bounded by 

s "(Areajnfo)" and VAreaJnfo) 0 is correct SVG data. 
[0139] In this way, it is checked If the SVG data is 
present at the end of the data to be processed, and is 
correct if the presence and correctness of the SVG 
data are confirmed, the flow advances to step S4604. It 

w is concluded in step S4604 that area information is 
appended (correct SVG data Is present), and this proc- 
ess ends. On the other hand, If it is determined in step 
S4602 or S4603 that no SVG data is present or SVG 
data is incorrect, the flow advances to step S4605. It is 

15 concluded in step S4605 that area information is not 
appended (no correct SVG data is present). 
[0140] In this manner, discrimination of area infor- 
mation ends. 

[0141] Referring back to the flow chart in Fig. 18, if 

20 it is concluded by the process shown in the flow chart in 
Rg. 19 that area information is registered, the flow 
advances to step S4503. In step S4503, the contents of 
the registered area information are displayed on the 
basis of SVG data bounded by the character strings 

25 "(Areajnfo)" and VAreaJnfo)", thus ending the proc- 
ess. On the other hand, if it is determined in step S4502 
that no area information is registered, the process ends. 
[0142] As described above, according to the eighth 
embodiment, still image data with area information, and 

30 normal still image data can be discriminated from each 
other based on SVG rules, and as for the still image 
data appended with the area information, its area infor- 
mation can be used. In this embodiment, contents 
based on SVG data are displayed, but SVG data may be 

35 used for various other purposes. For example, when an 
image is clicked, an area including the clicked point may 
be clipped. Also, SVG data may be used as a kind of 
meta-data upon searching for an area having a specific 
shape or color. 

40 [0143] Since SVG data described in the existing 
data description language Is used as area information, 
existing tools of that data description language can be 
used in a search process using meta-data, thus saving 
extra efforts for development. 

45 [0144] In the above embodiment, SVG data is used 
to describe area information, but the present invention is 
not limited to such specific data type. For example, data 
description languages such as XML, SGML, HTML, and 
the like may be used. 

so [0145] In the seventh and eighth embodiments, 
SVG data as meta-data is appended to the end of still 
image data as binary data. However, as described In the 
third to sixth embodiments, a check sum of SVG data, or 
a marker code and size information of SVG data may be 

55 registered, and reliability upon discriminating the pres- 
ence/absence of SVG data may be improved using such 
information. 
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[Ninth Embodiment] 

[0146] The format of area Information to be 
appended to still image data will be explained below. 
The ninth embodiment will exemplify the data format of 
area Information to be appended to still Image data, In 
which each area is expressed by index information Indi- 
cating an area number. That Is, the index information 
represents a unique area number assigned to each of 
areas obtained by area segmentation. Based on this 
Index information, area information is formed by storing 
area numbers in correspondence with pixels of a stilt 
images. 

[0147] Note that area information corresponding to 
each pixel Is stored in the same manner as color infor- 
mation of a pixel in still image data. That is, color infor- 
mation corresponding to each pixel is replaced by area 
information In the data format. After that, tags (e.g., 
"<AreaJnfo> ,, and VArea_lnfo>°) indicating the start and 
end of area Information are attached before and after 
the obtained area information. 
[0148] Rg. 23 is a view for explaining the data for- 
mat of area information according to the ninth embodi- 
ment. In Rg. 23, areas A1 and A2 are extracted as area 
segmentation results, in such case, the following bitmap 
data can be used as area Information. More specifically, 
a numerical value indicating the absence of area num- 
bers ("0" in this embodiment) Is assigned to pixel data 
corresponding to an area other than areas A1 and A2. 
Area numbers indicating A1 and A2 are respectively 
assigned to pixel data that belong to areas A1 and A2. 
Note that each area may be approximated by a rectan- 
gle, polygon, circle, oval, or the tike. In this manner, by 
holding area numbers as bitmap data, the area number 
of an arbitrary pixel can be obtained. For this reason, 
when an arbitrary area of an image is clicked, all pixels 
having the same area number as that of the pixel Indi- 
cated by the mouse pointer can be selected. Using such 
feature, a function of clipping an arbitrary area can be 
implemented. 

[10th Embodiment] 

[0149] In the 10th embodiment, another embodi- 
ment of the data format of area information will be 
explained. In a method of appending area information to 
still image data In the 10th embodiment, each area is 
expressed by a set of binary bitmap data indicating the 
interior/exterior of an area bounded by a minimum rec- 
tangle, and data indicating the position and size of the 
rectangle. 

[01 50] Fig. 21 is a flow chart for explaining the proc- 
ess for generating area information in the data format 
according to the 10th embodiment. Rg. 22A shows an 
image segmented into areas, and Rg. 22B shows an 
example of the data format of an area information list of 
the 10th embodiment, which corresponds to the seg- 
mented areas shown in Fig. 22A. The following explana- 



tion will be given with reference to these figures. 
[0151] In step S4801 , a still image undergoes area 
segmentation, and unique area numbers are assigned 
as indices to respective areas. Note that area segmen- 

5 tation may be done either automatically or manually. 
Rg. 22A shows a state wherein three areas are 
extracted as area segmentation results, and area num- 
bers (indices) A1 to A3 are assigned to these areas. 
[01 52] It is checked in step S4802 if areas that have 

10 not undergone processes in steps S4603 to S4B06 
remain, and if unprocessed areas are found, the flow 
advances to step S4803. The processes in steps S4803 
to S4806 obtain binary bitmap data, position, and size 
of the area of interest 

is [0153] In step S4803, the maximum and minimum 
values of x- and y-coordinates of the area of interest are 
obtained. In step S4804, a rectangle S which has the 
coordinates obtained in step S4803 as its four corners is 
formed to bound the area of interest in step S4805, 

20 binary bitmap data is obtained by binarlzatlon, i.e., 
checking if pixels are included in the area bounded by 
the rectangle S in step S4804. 
[0154] In step S4806, the coordinate position (rec- 
tangle position) of the upper left vertex, and the x- and 

25 y-sizes (rectangle size) of the rectangle S are computed 
from the coordinates of the four corners of the rectangle 
S, and the obtained rectangle information, rectangle 
size, and binary bitmap data are added to an area infor- 
mation list as area information. 

30 [0155] For example, if the area of interest Is A1, a 
rectangle that circumscribes area A1 is obtained in 
steps S4803 and S4804. in step S4805, binary bitmap 
data is generated by setting "1" In bits indicating pixels 
Included in the rectangle, and "0 a in bits indicating pixels 

35 outside the rectangle. Furthermore, in step S4806 the 
position of the rectangle (the coordinates of a1 ) and size 
(X1 and Y1) are obtained, and are added to the area 
information list together with the aforementioned binary 
bitmap data, as shown in Fig. 22B. 

40 [0156] Such processes are repeated for areas A2 
and A3, and if it is determined In step S4802 that the 
processes are completed for all the areas in the image, 
this process ends. 

[0157] In the aforementioned sequence, Informa- 
45 tlon including a set of binary bitmap data indicating the 
interior/exterior of a minimum rectangular region that 
bounds each segmented area, and data indicating the 
position and size of the rectangle is generated, and is 
stored in the form of list shown in Fig. 22B, thus gener- 
so ating an area list By attaching tags (e.g., "(Areajnfo) 0 
and VAreaJnfo)") to the start and end of the area list, 
area information that can be used in the above embodi- 
ments is generated. In this manner, in the 10th embodi- 
ment, bitmap data Indicating the interior/exterior of the 
55 area is held. For this reason, upon executing processes 
in units of areas, when data Is stored In the format of the 
ninth embodiment, a set of pixels that form an area must 
be acquired by scanning an image. However, when the 
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bitmap data, Indicating the Interior/exterior of the area of 
the 10th embodiment is used, a need for the process for 
acquiring a set of pixels that form the area by scanning 
an image can be obviated. 

[0158] In the 10th embodiment, a rectangle that 
bounds an area is used, but an area may be expressed 
by other polygons or curves. 

[0159] As described above, according to the 
present invention, meta-data can be registered to exist- 
ing binary data without any influence on existing appli- 
cations. That is, binary data registered with meta-data 
can be provided in a format that can be processed by an 
existing application. 

[0160] Also, according to the present invention, 
when a data description language is used to describe 
meta-data, tools for the existing data description lan- 
guage can be used, and a compatible application can 
be easily developed. 

[0161] Furthermore, according to the present inven- 
tion, meta-data can be extracted from binary data in 
which meta-data is described, and can be used in proc- 
esses such as search, reference, change, and the like. 
[0162] Moreover, according to the present inven- 
tion, confirmation information such as a check sum is 
registered together with meta-data, and the pres- 
ence/absence of meta-data is discriminated using this 
confirmation information, thus allowing more accurate 
discrimination of meta-data. 

[0163] In addition, according to the present inven- 
tion, since the size of meta-data is registered to binary 
data together with meta-data, binary data in which 
meta-data is described can be easily discriminated. 
[0164] Furthermore, according to the present Inven- 
tion, area information can be appended to still Image 
data without any influences on existing applications. 
[0165] Moreover, according to the present inven- 
tion, still image data registered with area information 
can be provided in a format that can be processed by an 
existing application. 

[0166] In addition, according to the present inven- 
tion, since a general data description language is used 
to describe the area information, existing tools for that 
data description language can be used, and a compati- 
ble application can be easily developed 
[0167] Also, according to the present invention, 
area information can be easily extracted from still image 
data in which area information is described, and can be 
used in various processes. 

[0168] Furthermore, the present invention can be 
applied to the system comprising either a plurality of 
units or a single unit. It is needless to say that the 
present invention can be applied to the case which can 
be attained by supplying programs which execute the 
process defined by the present system or invention. 
[01 69] As many apparently widely different embodi- 
ments of the present invention can be made without 
departing from the spirit and scope thereof, it is to be 
understood that the Invention is not limited to the spe- 



cific embodiments thereof except as defined in the 
claims. 

[01 70] The computer program disclosed above can 
be obtained in electronic form for example by download- 

s ing the code over a network such as the internet Thus 
in accordance with another aspect of the present inven- 
tion there is provided an electrical signal carrying proc- 
essor implementable instructions for controlling a 
processor to carry out the method as hereinbefore 

10 described. 

Claims 

1 . A data processing method for registering meta-data 
is to binary data, characterized by comprising: 

the first loading step (S1301 , S2301 , S3301) of 
loading binary data to which meta-data is to be 
appended; 

20 the second loading step (S1302, S2303, 

S3302) of loading meta-data to be appended to 
the binary data; 

the connection step (S1304, S2306, S3306) of 
connecting the meta-data loaded in the second 
25 loading step after the binary data loaded in the 

first loading step; and 

the output step (S1306, S2307, S3307) of out- 
putting data obtained in the connection step as 
a single file as a whole. 

30 

2. The method according to claim 1 , further compris- 
ing the judging step (ST 303, S2303, S3303) of 
judging If the meta-data loaded in the second load- 
ing step Is described In a correct format in a prede- 

35 termined data description language, and 

wherein the connection step includes the step of 
connecting the meta-data after the binary data 
when it is determined in the judging step that the 
meta-data is described in the correct format 

40 

3. The method according to claim 2, wherein the judg- 
ing step includes the step of also judging if the 
meta-data satisfies correctness as the predeter- 
mined data description language. 

45 

4. The method according to claim 1 , further compris- 
ing the generation step (S2304) of generating con- 
firmation Information on the basis of the meta-data 
loaded in the second loading step, and 

so wherein the connection step (S2306) includes the 
step of connecting the confirmation information 
generated in the generation step and the meta-data 
loaded In the second loading step after the binary 
data loaded in the first loading step. 

55 

5. The method according to claim 4, wherein the gen- 
eration step includes the step of generating a plu- 
rality of kinds of confirmation information In 
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correspondence with the meta-data loaded in the 
second loading step. 

6. The method according to claim 4, further compris- 
ing the additional description step of additionally 
describing information representing a type of confir- 
mation information used in the generation step in 
the meta-data loaded in the second loading step. 

7. The method according to claim 4, further compris- 
ing the Judging step of Judging if the meta-data 
loaded in the second loading step is described in a 
correct format In a predetermined data description 
language, and 

wherein the generation step and the connection 
step are executed when it is determined in the judg- 
ing step that the meta-data is described in the cor- 
rect format 

8. The method according to claim 7, wherein the judg- 
ing step includes the step of also judging if the 
meta-data satisfies correctness as the predeter- 
mined data description language. 

9. The method according to claim 1 , further compris- 
ing the generation step (S3304) of generating size 
information indicating a size of the meta data 
loaded in the second loading step, and 

wherein the connection step (S3307) includes the 
step of connecting the meta-data loaded in the sec- 
ond loading step after the binary data loaded In the 
first loading step, and then connecting the size 
Information generated in the generation step after 
the meta-data. 

10. The method according to claim 9, wherein the con- 
nection step includes the step of connecting prede- 
termined data immediately after the binary data, 
and then connecting the meta-data and size infor- 
mation thereafter. 

11. The method according to claim 9, further compris- 
ing the judging step of judging If the meta-data 
loaded in the second loading step is described in a 
correct format In a predetermined data description 
language, and 

wherein the generation step and the connection 
step are executed when It is determined in the judg- 
ing step that the meta-data is described in the cor- 
rect format 

12. The method according to claim 11, wherein the 
judging step includes the step of also judging if the 
meta-data satisfies correctness as the predeter- 
mined data description language. 

13. The method according to claim 1, wherein the 
binary data is image data representing an image, 



and the meta-data represents area Information that 
pertains to the image data. 

14. The method according to claim 13, wherein the 
5 area Information indicates an area obtained by area 

segmentation. 

15. The method according to claim 14, wherein the 
area information expresses the area using polygo- 

10 nal approximation. 

16. The method according to claim 14, wherein the 
area information expresses the area using curvilin- 
ear approximation. 

15 

17. The method according to claim 14, wherein the 
area information has an area number assigned to 
an area to which pixels belong as data of corre- 
sponding pixels. 

20 

18. The method according to claim 14, wherein the 
area information has a set of binary bitmap data 
indicating the interior/exterior of a minimum rectan- 
gular region that bounds each area, and data indi- 

25 eating a position and size of the rectangle. 

19. A data processing method for discriminating meta- 
data which is registered to binary data, character- 
ized by comprising: 

30 

the loading step (S501, S1601, S2601, S3601) 
of loading data; 

the judging step (S502, S1602, S1603, S2602, 
S26G3, S3602-S3605) of Inspecting the data 
35 loaded in the loading step from an end, and 

judging If data described in a correct format in 
a predetermined data description language is 
present; and 

the discrimination step (S1604, S1605, 
40 S2604-S2608. S3607, S3608) of discriminat- 

ing meta-data included in the data on the basis 
of a judgment result in the judging step. 

20. The method according to claim 19, further comprls- 
45 ing the output step (S503) of extracting and output- 
ting the discriminated meta-data, when the meta- 
data is discriminated in the discrimination step. 

21. The method according to claim 20, wherein the out- 
so put step includes the step of displaying based on 

the extracted meta-data. 

22. The method according to claim 1 9, wherein the out- 
put step includes the step of providing the extracted 

55 meta-data to a tool for processing the predeter- 
mined data description language. 

23. The method according to claim 19, wherein the dls- 
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crimination step comprises: 

the checking step (S1 602) of checking if an end 
character string specified in the predetermined 
data description language is present at an end 
of the data; 

the search step (S1 603) of searching for a start 
character string specified In the predetermined 
data description language toward a head of the 
data, when the end character string is present; 
and 

the inspection step (Si 603) of inspecting rf 
data between the start and end character 
strings has a correct format in the predeter- 
mined data description language, when the 
start character string is found in the search 
step. 

24. The method according to claim 23, wherein the 
inspection step includes the step of also inspecting 
if correctness as the predetermined data descrip- 
tion language is satisfied. 

25. The method according to claim 19, wherein the 
judging step comprises: 

the first judging step (S2602, S2603) of 
inspecting the data loaded in the loading step 
from an end to judge if data described in a pre- 
determined format of a predetermined data 
description language is present; 
the generation step (S2605) of generating con- 
firmation information on the basis of the data 
described in the predetermined format, when ft 
is determined in the first judging step that the 
data In the predetermined format is present; 
the comparison step (S2604, S2606) of com- 
paring the confirmation information generated 
in the generation step with confirmation infor- 
mation, which is stored at a predetermined 
position of the data described in the predeter- 
mined format in the data loaded in the loading 
step; and 

the second Judging step (S2607, S2608) of 
judging based on a comparison result in the 
comparison step 'rf the data described in the 
predetermined format is meta-data. 

26. The method according to claim 25, wherein the 
generation step includes the step of generating a 
plurality of kinds of confirmation information. 

27. The method according to claim 25, wherein the 
generation step includes the step of generating 
confirmation information of a type Instructed by pre- 
determined information included in the data 
described in the predetermined format 



28. The method according to claim 25, wherein the pre- 
determined position where the confirmation Infor- 
mation is stored is immediately before the data 
described in the predetermined format. 

5 

29. The method according to claim 25, further compris- 
ing the output step of extracting and outputtlng the 
discriminated meta-data when the meta-data is dis- 
criminated in the discrimination step. 

w 

30. Trie method according to claim 29, wherein the out- 
put step Includes the step of displaying based on 
the extracted meta-data. 

is 31. The method according to claim 29, wherein the out- 
put step Includes the step of providing the extracted 
meta-data to a tool for processing the predeter- 
mined data description language. 

20 32. The method according to claim 25, wherein the first 
judging step comprises: 

the checking step of checking if an end charac- 
ter string specified in the predetermined data 
25 description language is present at an end of 

the data; 

the search step of searching for a start charac- 
ter string specified in the predetermined data 
description language toward a head of the 
30 data, when the end character string is present; 

and 

the inspection step of inspecting if data 
between the start and end character strings 
has a correct format in the predetermined data 
35 description language, when the start character 

string is found in the search step. 

33. The method according to claim 32, wherein the 
inspection step includes the step of also inspecting 

40 if correctness of the predetermined data description 
language is satisfied. 

34. The method according to claim 19, wherein the 
judging step (S3602-S3605) includes the step of 

45 extracting data stored at an end of the data loaded 
in the loading step as size information, and deter- 
mining data expressed by the size information as 
meta-data when data connected to a head of data 
having a size indicated by the size information has 

so a predetermined data value. 

35. The method according to claim 19, wherein the 
judging step comprises: 

55 the extraction step (S3604, S3605) of extract- 

ing data stored at an end of the data loaded in 
the loading step as size information, and 
extracting data corresponding to meta-data on 
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the basis of the size information, and 
the judging step includes the step of inspecting 
the data extracted in the extraction step and 
judging if data described in a predetermined 
format in a predetermined data description Ian- s 
guage Is present. 

36. The method according to claim 35, wherein the 
extraction step Includes the step of extracting data 
stored at an end of the data loaded In the loading to 
step as size information, and determining data 
expressed by the size information as meta-data 
when data connected to a head of data having a 
size indicated by the size information has a prede- 
termined data value. is 

37. The method according to claim 35, further compris- 
ing the output step of outputting the discriminated 
meta-data, when it is determined in the judging step 
that the meta-data is present. zo 

38. The method according to claim 37, wherein the out- 
put step includes the step of displaying on the basis 
of the meta-data. 

25 

39. The method according to claim 37, wherein the out- 
put step includes the step of providing the meta- 
data to a tool for processing the predetermined data 
description language. 

30 

40. The method according to claim 35, wherein the 
judging step includes the step of also Inspecting if 
correctness as the predetermined data description 
language is satisfied. 

35 

41. The method according to claim 19, wherein the 
data loaded in the loading step includes image 
data, and the meta-data represents area informa- 
tion of the image data. 

40 

42. The method according to claim 41 , wherein the dis- 
crimination step includes the step of discriminating 
area information included in the data by inspecting 
the data loaded In the loading step from an end to 
check If partial data bounded by predetermined 45 
identifiers specified in the predetermined data 
description language is present. 

43. The method according to claim 42, further compris- 
ing the output step of extracting and outputting the so 
discriminated area information when the area infor- 
mation is discriminated In the discrimination step. 

44. The method according to claim 43, wherein the out- 
put step includes the step of displaying based on 55 
the extracted area information. 

45. The method according to claim 43, wherein the out- 



put step Includes the step of providing the extracted 
area information to a predetermined processing 
tool that uses the area information. 

46. The method according to claim 44, wherein the dis- 
crimination step comprises: 

the checking step of checking if a first identifier 
is present at an end of the data to be proc- 
essed; and 

the search step of searching for a second iden- 
tifier toward a head of the data to be processed 
when the first identifier is present, and 
the discrimination step includes the step of dis- 
criminating data bounded by the first and sec- 
ond Identifiers as the area information when 
the second Identifier is found in the search 
step. 

47. The method according to claim 42, wherein the 
area information indicates an area obtained by area 
segmentation. 

48. The method according to claim 47, wherein the 
area information expresses the area using polygo- 
nal approximation. 

49. The method according to claim 47, wherein the 
area information expresses the area using curvilin- 
ear approximation. 

50. The method according to claim 47, wherein the 
area Information has an area number assigned to 
an area to which pixels belong as data of corre- 
sponding pixels. 

51. The method according to claim 47, wherein the 
area information has a set of binary bitmap data 
indicating the interior/exterior of a minimum rectan- 
gular region that bounds each area, and data indi- 
cating a position and size of the rectangle. 

52. The method according to claim 1, wherein the 
binary data Includes one of Image data, audio data, 
and moving Image data. 

53. The method according to claim 1, wherein the pre- 
determined data description language has a 
description format of one of XML, SGML, HTML, 
and SVG. 

54. The method according to claim 4, wherein the con- 
firmation information includes a check sum using at 
least one of the number of bytes, the number of 
characters, the number of words, the number of 
lines, and the number of items of meta-data. 

55. The method according to claim 9, wherein the size 
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information expresses the size by one of a Itttle- 
endian binary format, blg-endlan binary format, and 
text format 

56. The method according to claim 9, wherein the size s 
information expresses the size by the same format 

as meta-data. 

57. The method according to claim 19, wherein the 
binary data includes one of image data, audio data, 10 
and moving image data. 

58. The method according to claim 19, wherein the pre- 
determined data description language has a 
description format of one of XML, SGML, HTML, w 
and SVG. 

59. The method according to claim 25, wherein the 
confirmation information includes a check sum 
using at least one of the number of bytes, the 20 
number of characters, the number of words, the 
number of lines, and the number of items of meta- 
data. 

60. The method according to claim 34, wherein the size 25 
information expresses the size by one of a little- 
endian binary format, big-endian binary format, and 
text format 

61 . The method according to claim 34, wherein the size 30 
information expresses the size by the same format 

as meta-data. 

62. The method according to claim 35, wherein the size 
information expresses the size by one of a little- 35 
endian binary format, big-endian binary format, and 
text format 

63. The method according to claim 35, wherein the size 
information expresses the size by the same format 40 
as meta-data. 

64. A data processing apparatus for registering meta- 
data to binary data, characterized by comprising: 

45 

first loading means (S1301 , S2301, S3301) for 
loading binary data to which meta-data is to be 
appended; 

second loading means (S1302, S2302, S3302) 
for loading meta-data to be appended to the so 
binary data; 

connection means (S1304, S2306, S3306) for 
connecting the meta-data loaded by said sec- 
ond loading means after the binary data loaded 
by said first loading means; and 55 
output means (S1306, S2307, S3307) for out- 
putting data obtained by said connection 
means as a single file as a whole. 



65. The apparatus according to claim 64, further com- 
prising generation means (S2304) for generating 
confirmation information on the basis of the meta- 
data loaded by said second loading means, and 
wherein said connection means (S2306) connects 
the confirmation information generated by said gen- 
eration means and the meta-data loaded by said 
second loading means after the binary data loaded 
by said first loading means. 

66. The apparatus according to claim 64, further com- 
prising generation means (S3304) for generating 
size Information indicating a size of the meta data 
loaded by said second loading means, and 
wherein said connection means (S3307) connects 
the meta-data loaded by said second loading 
means after the binary data loaded by said first 
loading means, and then connects the size informa- 
tion generated by said generation means after the 
meta-data. 

67. The apparatus according to claim 64, wherein the 
binary data is image data representing an image, 
and the meta-data represents area information that 
pertains to the image data. 

68. A data processing apparatus for discriminating 
meta-data which is registered to binary data, char- 
acterized by comprising: 

loading means (S501, SI 601, S2601, S3601) 
for loading data; 

Judging means (S502, SI 602, S1603, S2602, 
S2603, S3602-3605) for Inspecting the data 
loaded by said loading means from an end, and 
Judging if data described in a correct format In 
a predetermined data description language is 
present; and 

discrimination means (S1604, S1605, 
S2604-S2608, S3607, S3608) for discriminat- 
ing meta-data included in the data on the basis 
of a Judgment result of said Judging means. 

69. The apparatus according to claim 68, wherein said 
judging means comprises: 

first judging means (S2602, S2603) for inspect- 
ing the data loaded by said loading means from 
an end to judge If data described in a predeter- 
mined format of a predetermined data descrip- 
tion language is present; 
generation means (S2605) for generating con- 
firmation information on the basis of the data 
described in the predetermined format, when 
said first judging means determines that the 
data in the predetermined format is present; 
comparison means (S2604, S2606) for com- 
paring the confirmation information generated 
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by said generation means with confirmation 
Information, which is stored at a predetermined 
position of the data described in the predeter- 
mined format in the data loaded by said loading 
means; and 5 
second judging means (S2607, S2608) for 
judging based on a comparison result of said 
comparison means if the data described in the 
predetermined format is meta-data. 

10 

70. The apparatus according to claim 68, wherein said 
judging means (S3602-S3605) extracts data 
stored at an end of the data loaded In the loading 
means as size information, and determines data 
expressed by the size Information as meta-data is 
when data connected to a head of data having a 
size indicated by the size information has a prede- 
termined data value. 

71. The apparatus according to claim 68, wherein said 20 
judging means comprises: 

extraction means (S3604, S3605) for extracting 
data stored at an end of the data loaded by said 
loading means as size information, and extract- 25 
ing data corresponding to meta-data on the 
basis of the size information, and 
the judging means inspects the data extracted 
by said extraction means and judges if data 
described in a predetermined format in a pre- 30 
determined data description language is 
present 

72. The apparatus according to claim 68, wherein the 
data loaded by said loading means includes image 35 
data, and the meta-data represents area informa- 
tion of the image data. 

73. A storage medium for storing a control program for 
making a computer implement a data process for 40 
registering meta-data in binary data, said control 
program characterized by comprising: 

a code of the first loading step (S1301 , S2301, 

S3301) of loading binary data to which meta- 45 

data is to be appended; 

a code of the second loading step (S1302, 

S2302, S3302) of loading meta-data to be 

appended to the binary data; 

a code of the connection step (S1304, S2036, so 

53306) of connecting the meta-data loaded in 
the second loading step after the binary data 
loaded in the first loading step; and 

a code of the output step (S1306, S2307, 

53307) of outputting data obtained in the con- ss 
nectlon step as a single file as a whole. 

74. The medium according to claim 73, wherein said 



control program further comprises a code of the 
generation step (S2304) of generating confirmation 
information on the basis of the meta-data loaded in 
the second loading step, and 

the connection step (S2306) includes the step 
of connecting the confirmation information gen- 
erated in the generation step and the meta- 
data loaded in the second loading step after the 
binary data loaded in the first loading step. 

75. The medium according to claim 73, wherein said 
control program further comprises a code of the 
generation step (S3304) of generating size informa- 
tion indicating a size of the meta data loaded in the 
second loading step, and 

the connection step (S3307) includes the step 
of connecting the meta-data loaded in the sec- 
ond loading step after the binary data loaded in 
the first loading step, and then connecting the 
size information generated in the generation 
step after the meta-data. 

76. The medium according to claim 73, wherein the 
binary data Is image data representing an image, 
and the meta-data represents area information that 
pertains to the image data. 

77. A storage medium for storing a control program for 
making a computer implement a data process for 
discriminating meta-data which is registered to 
binary data, said control program characterized by 
comprising: 

a code of the loading step (S501, S1601, 
S2601, S3601) of loading data; 
a code of the judging step (S502, S1602, 
S1603, S2602, S2603, S36O2~S3605)of 
inspecting the data loaded in the loading step 
from an end, and judging if data described in a 
correct format in a predetermined data descrip- 
tion language is present; and 
a code of the discrimination step (S1604, 
S1605, S2604-S260B, S3607, S3608) of dis- 
criminating meta-data included in the data on 
the basis of a judgment result in the judging 
step. 

78. The medium according to claim 77, wherein the 
code of the judging step comprises: 

a code of the first judging step (S2602, S2603) 
of inspecting the data loaded In the loading 
step from an end to judge if data described in a 
predetermined format of a predetermined data 
description language is present; 
a code of the generation step (S2605) of gener- 
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atlng confirmation Information on the basis of 
the data described In the predetermined for- 
mat, when it is determined in the first judging 
step that the data in the predetermined format 
is present; 5 
a code of the comparison step (S2604, S2606) 
of comparing the confirmation information gen- 
erated in the generation step with confirmation 
information, which is stored at a predetermined 
position of the data described in the predeter- 10 
mined format in the data loaded in the loading 
step; and 

a code of the second judging step (S2607, 
S2608) of judging based on a comparison 
result in the comparison step if the data 75 
described in the predetermined format is meta- 
data. 



lated using the number of words included in the 
meta-data. 

85. The method according to claim 4, wherein the con- 
firmation information includes check sum calcu- 
lated using the number of lines included In the 
meta-data. 

86. The method according to claim 4, wherein the con- 
firmation Information Includes check sum calcu- 
lated using the number of items bounded by tags 
included in the meta-data. 

87. The method according to claim 25, wherein the 
confirmation information includes check sum calcu- 
lated using the number of bytes included in the 
meta-data. 



79. The medium according to claim 77, wherein the 
Judging step (S3602-S3605) includes the step of 20 
extracting data stored at an end of the data loaded 

in the loading step as size information, and deter- 
mining data expressed by the size information as 
meta-data when data connected to a head of data 
having a size indicated by the size information has 25 
a predetermined data value. 

80. The medium according to claim 77, wherein the 
code of the judging step includes: 

30 

a code of the extraction step (S3604, S360S) of 
extracting data stored at an end of the data 
loaded In the loading step as size Information, 
and extracting data corresponding to meta- 
data on the basis of the size information; and 35 
a code of the step of inspecting the data 
extracted in the extraction step and judging if 
data described in a predetermined format in a 
predetermined data description language is 
present. 40 

81. The medium according to claim 77, wherein the 
data loaded in the loading step includes image 
data, and the meta-data represents area informa- 
tion of the Image data. 45 

82. The method according to claim 4, wherein the con- 
firmation information includes check sum calcu- 
lated using the number of bytes included in the 
meta-data. so 

83. The method according to claim 4, wherein the con- 
firmation information includes check sum calcu- 
lated using the number of characters included in the 
meta-data. 55 

84. The method according to claim 4, wherein the con- 
firmation information includes check sum calcu- 



88. The method according to claim 25, wherein the 
confirmation information Includes checksum calcu- 
lated using the number of characters included in the 
meta-data. 

89. The method according to claim 25, wherein the 
confirmation Information includes check sum calcu- 
lated using the number of words included in the 
meta-data 

90. The method according to claim 25, wherein the 
confirmation information Includes check sum calcu- 
lated using the number of lines included in the 
meta-data 

91. The method according to claim 25, wherein the 
confirmation information includes check sum calcu- 
lated using the number of items bounded by tags 
included in the meta-data. 

92. A method of formatting data files including a header 
portion, a data portion and a descriptive information 
portion containing descriptive information associ- 
ated with data contained in the data portion, the 
method comprising appending the data portion to 
the header portion and appending the descriptive 
information portion to the data portion. 

93. A computer programme comprising processor 
implementable instructions for carrying out a 
method as claimed In any one of claims 1 to 63 and 
81 to 92. 

94. A signal carrying processor implementable instruc- 
tions for carrying out a method as claimed in any 
one of claims 1 to 63 and 81 to 92. 
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